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These  guidelines  are  supplemented  by  other  OCR  documents  for  character  shapes,  char¬ 
acter  positioning,  inks,  papers,  and  forms  design. 

The  material  contained  in  these  guidelines  originally  appeared  in  American  National  Stan¬ 
dard  Character  Set  and  Print  Quality  for  Optical  Character  Recognition  (OCR-A),  ANSI 
X3 .17-1977.  In  1981,  ANSI  X3. 17-1977  was  revised  and  became  a  standard  for  OCR-A 
character  shapes  and  size  only  (American  National  Standard  Character  Set  for  Optical 
Character  Recognition  (OCR-A),  ANSI  X3 .17-1981  ).  The  print  quality  information  that 
was  deleted  in  the  revision  is  now  provided  in  these  guidelines.  No  new  quality  require¬ 
ments  have  been  added. 

This  document,  while  based  on  OCR-A  and  OCR-B,  can  serve  as  a  print  quality  guide¬ 
line  for  other  constant-stroke-width  machine-readable  fonts. 

The  terms,  uppercase  and  lowercase,  are  widely  used  and  accepted  in  the  United  States 
and  have  been  used  in  these  guidelines  in  place  of  the  International  Standards  Organiza¬ 
tion  terms,  letter  and  small  letter.  In  this  document,  these  terms  have  the  same  meaning 
and  can  be  used  interchangeably. 

Suggestions  for  improvement  of  this  standard  will  be  welcome.  They  should  be  sent  to 
the  Computer  and  Business  Equipment  Manufacturers  Association,  31 1  First  Street,  NW, 
Suite  500,  Washington,  D.C.  20001 . 

This  standard  was  processed  and  approved  for  submittal  to  ANSI  by  American  National 
Standards  Committee  on  Information  Processing  Systems,  X3.  Committee  approval  of  the 
standard  does  not  necessarily  imply  that  all  committee  members  voted  for  its  approval. 

At  the  time  it  approved  this  standard,  the  X3  Committee  had  the  following  members: 

Edward  Lohse,  Chair 

Catherine  A.  Kachurik,  Administrative  Secretary 
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1 .  Scope,  Purpose,  and  Application 

1.1  Scope.  These  guidelines  describe  the  print  quality 
parameters  and  measuring  techniques  for  determining 
the  quality  of  machine-printed  characters  to  maximize 
the  likelihood  that  they  can  be  read  by  electro-optical 
means. 

1.2  Purpose.  The  purpose  of  these  guidelines  is  to  es¬ 
tablish  procedures  for  determining  suitability  of 
machine-printed  characters  to  be  read  optically. 

1.3  Application.  Because  of  the  variety  of  OCR  appli¬ 
cations,  these  guidelines  may  not  include  all  of  the 
necessary  print  quality  specifications  or  considerations 
for  successful  OCR  reading.  Items  such  as  size  of  char¬ 
acter  set,  document  size,  mechanical  properties  of  the 
paper,  and  format  details  should  be  resolved  by  con¬ 
sultation  between  users,  suppliers,  and  OCR  manufac¬ 
turers.  Although  each  parameter  may  be  independently 
specified,  reading  performance  may  deteriorate  if  the 
limits  of  two  or  more  parameters  are  approached  simul¬ 
taneously.  Every  effort  should  be  made  to  keep  well 
within  the  limits.  It  is  recognized,  however,  that  in 
bulk  printing  from  any  device,  the  limits  will  oc¬ 
casionally  and  randomly  be  exceeded. 


2.  Related  Standards 

2.1  American  National  Standards.  These  guidelines 
are  intended  to  be  used  in  conjunction  with  the  follow¬ 
ing  American  National  Standards: 

ANSI  X3 .17-1981 ,  Character  Set  for  Optical  Character 
Recognition  (OCR-A) 

ANSI  X3 .49-1975  (R1982)  Character  Set  for  Optical 
Character  Recognition  (OCR-B) 

ANSI  X3 .62-1979,  Paper  Used  in  Optical  Character 
Recognition  (OCR)  Systems 

ANSI  X3 .86-1980,  Optical  Character  Recognition 
(OCR)  Inks 


ANSI  X3 .9314-1981 ,  Optical  Character  Recognition 
(OCR)  Character  Positioning 

2.2  Other  Publications.  These  guidelines  are  intended 
to  be  used  in  conjunction  with  the  following  publica¬ 
tions: 

ASTM  E  308-66  ( 1981),  Standard  Practice  for  Spectro¬ 
photometry  and  Description  of  Color  in  CIE  1931  Sys¬ 
tem1 

ASTM  F  149-76  ( 1980),  Standard  Definitions  of  Terms 
Relating  to  Optical  Character  Recognition1 

X3/TR-5-82,  Design  of  Optical  Character  Recognition 
Forms  (Technical  Report)2 

3.  Definitions  and  Characteristics  of  the 
Printed  Image 

3.1  General.  The  performance  of  OCR  systems  de¬ 
pends  to  a  large  extent  on  the  optical  and  dimensional 
properties  of  the  printed  image.  For  machine  recogni¬ 
tion  of  printed  information,  the  paper  shall  provide 
high  reflectance  and  printed  images  shall  be  sufficient¬ 
ly  low  reflectance  to  provide  adequate  contrast  in  the 
spectral  range  of  interest. 

All  printed  devices  are  not  capable  of  producing 
images  to  the  same  tolerance  ranges,  and  all  readers  are 
not  capable  of  reliably  recognizing  characters  in  all  tol¬ 
erance  ranges.  In  general,  the  performance  level  of  an 
OCR  reader  will  depend  upon  the  number  of  characters 
to  be  recognized  and  the  ability  of  the  reader  to  tol¬ 
erate  a  range  of  stroke  widths,  contrasts,  voids,  spots, 
and  character  position  variations.  Considering  these 
factors,  three  ranges  of  print  quality  have  been  defined 
in  3.2.  They  describe  the  wide  ranges  of  printing  pro¬ 
duced  by  various  devices  and  processes  and  aid  in 


1  Available  from  ASTM,  1916  Race  Street,  Philadelphia,  PA 
19103. 

2  Available  from  the  Computer  and  Business  Equipment  Manu¬ 
facturers  Association,  311  First  Street,  NW,  Suite  500,  Washing¬ 
ton,  DC  20001. 
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classifying  the  capability  of  the  associated  reader. 

It  should  be  emphasized  that  reliable  recognition  of 
any  specific  character  is  related  to  the  number  of  other 
characters  in  the  character  set  to  which  the  specific 
character  must  be  compared.  Thus,  when  using  limited 
or  partial  character  sets,  such  as  in  numeric-only  appli¬ 
cations,  fully  satisfactory  performance  can  be  achieved 
at  a  lower  level  of  print  quality.  Equivalent  perfor¬ 
mance  levels  using  a  larger  character  set  generally  re¬ 
quire  a  higher  level  of  print  quality. 

To  achieve  and  maintain  the  required  print  quality 
for  OCR,  special  precautions  are  necessary.  These  in¬ 
clude  judicious  selection  of  supplies,  more  frequent 
adjustment  and  maintenance  of  printing  equipment, 
and  frequent  changing  of  ribbons  or  checking  of  inks 
or  other  marking  supplies. 

This  document  contains  definitions  and  informa¬ 
tion  pertaining  to  the  print  quality  characteristics  of 
individual  printed  images  without  consideration  of: 

( 1 )  Their  position  on  the  document 

(2)  Their  spatial  relationship  to  other  character 
images 

(3)  Their  relationship  to  any  other  nonOCR  printing 
on  the  document 

In  many  cases,  the  dimensional  parameters  can  be 
determined  visually  using  gages  and  magnified  images. 
However,  reflectance  values,  spectral  response,  etc,  re¬ 
quire  special  types  of  instruments  to  perform  the  mea¬ 
surements. 

3.2  Print  Quality  Parameters  and  Tolerance  Ranges. 

The  quality  of  a  printed  image  is  established  from  a 
combination  of  measurable  optical  and  dimensional 
parameters.  Important  print  quality  parameters  include: 

( 1 )  Stroke-width  variation 

(2)  Conformance  to  the  character  shape  specified 

(3)  Reflectance  within  the  character  outline 

(4)  Reflectance  outside  the  character  outline 

(5 )  Voids,  spots,  or  extraneous  ink  areas 

In  any  practical  system,  some  tolerances  must  be 
permitted.  The  more  restrictive  the  specified  tolerance 
for  each  variable,  the  more  likely  it  is  that  the  print 
quality  of  the  character  will  fall  within  the  required 
range.  To  accommodate  the  variations  in  capability  of 
various  printing  and  reading  devices,  three  ranges  of 
print  quality  are  recognized: 

(T)  Range  x.  Defines  high-quality  printing.  This 
range  places  a  greater  burden  on  the  printing  or  ink¬ 
ing  devices  and  a  lesser  burden  on  the  reading  device. 

(2 )  Range  y.  Defines  medium-quality  printing.  In 
an  OCR  system  using  this  print  quality  range,  the 
burden  is  more  equally  shared  by  the  printing  and  the 
reading  devices. 

(3)  Range  z.  Defines  low-quality  printing.  This 


range  places  a  lesser  burden  on  the  printing  device  and 
a  greater  burden  on  the  reading  device. 

3.3  Definitions.  OCR  characters  are  designed  to  ulti¬ 
mately  be  read  by  machines.  The  scanning  portion 
translates  the  optical  and  dimensional  properties  to 
electrical  signals.  The  parameters  in  this  section  are  de¬ 
fined  in  terms  that  are  easily  relatable  to  signals  that  a 
typical  OCR  scanner  would  produce  while  scanning  the 
printing.  The  typical  scanner  is  assumed  to  have  a 
scanning  aperture  with  a  diameter  of  0.2  mm  (0.008  in). 
The  limits  given  in  these  guidelines  are  based  upon  this 
size  aperture.  The  limits  for  scanners  with  apertures  of 
a  different  size  must  be  extrapolated  to  determine  the 
equivalent  response. 

The  reflectance  values  are  normalized  to  the  docu¬ 
ment  background  reflectance  (see  discussion  of  print 
contrast  signal  [PCS]  ,  3.3.1 ).  Using  the  PCS  ratio  pro¬ 
vides  the  advantage  of  allowing  OCR  readers  to  be  self¬ 
calibrating.  PCS  can  be  converted  to  reflectance  ratios 
if  the  background  reflectance  is  known. 

For  additional  definitions,  see  ASTM  F  149-76  (1980). 

3.3.1  Print  Contrast  Signal  (PCS).  Print  contrast 
signal  (PCS)  is  an  expression  that  shows  the  relation¬ 
ship  of  the  contrast  between  any  selected  point  of 
interest  within  a  character  outline  and  the  background. 
The  PCS  value  of  a  point  (p)  is  defined  by  the  equation: 


where 

Rw  -  the  maximum  reflectance  found  within  the 
area  of  interest  to  which  the  PCS  of  point  p 
is  referenced.  In  measuring  printed  images, 
this  area  of  interest  shall  be  a  rectangle  ap¬ 
proximately  twice  the  nominal  character  cen¬ 
terline  height  (H)  by  twice  the  nominal  char¬ 
acter  centerline  width  (IP).  The  rectangle  shall 
be  centered  vertically  on  the  character  being 
measured  and  the  bottom  shall  be  located 
HI 2  below  the  character  base  line  (see  Figure 
1). 

Rp  -  the  reflectance  from  a  measured  area  centered 
on  the  point  p. 

The  reflectance  values  R M,  and  Rp  are  measured 
within  a  circular  area  (aperture)  having  a  diameter  of 
0.2  mm  (0.008  in). 

These  reflectance  specifications  deal  only  with  dif¬ 
fuse  reflectance.  Fight  that  is  specularly  reflected  shall 
be  excluded.  Reflectance  values  are  normally  refer¬ 
enced  to  barium  sulfate  or  magnesium  oxide.  Absolute 
values  are  not  necessary  in  the  determination  of  PCS. 
The  value  of  PCS  depends  only  on  the  relative  re¬ 
flectance  values  of  Rw  and  Rp. 
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Figure  1 

Illustration  of  Reflectance,  Rw  and  R 
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Reflectance  may  be  determined  either  by  spectro- 
photometric  measurements,  or  by  a  number  of  re¬ 
flectance  measurements  in  different  spectral  bands 
in  accordance  with  ASTM  E  308-66  ( 1981 ).  This  stan¬ 
dard  describes  the  equipment  used  for  these  measure¬ 
ments  and  the  CIE  color  system. 

The  reflectance  measurements  shall  be  made  using 
the  infinite  pad  method  (see  3.3.2)  and  in  the  spectral 
response  range  of  the  system  that  is  to  read  the  data 
(see  3.4). 

3.3.2  Infinite  Pad  Method.  The  infinite  pad  method 
for  measuring  paper  reflectance  is  the  method  in  which 
the  sample  paper  being  measured  is  backed  with  enough 
thicknesses  of  the  same  type  of  paper  so  that  doubling 
the  number  of  sheets  does  not  change  the  measured 
value  of  reflectance. 

3.3.3  Character  Outline  Limit  (COL).  The  dimen¬ 
sional  properties  of  the  printed  image  are  measured  by 
comparing  the  image  with  the  character  outline  limit 
(COL)  requirements.  The  COL  is  unique  to  each  char¬ 
acter  in  the  set  and  to  each  specified  stroke-width  tol¬ 
erance  range.  When  constructed  on  a  transparency  for 
visual  use,  the  COL  is  called  a  COL  gage. 

The  COL  requirements  include  definition  of  both 
minimum  and  maximum  character  outline  limits.  An 
example  of  a  COL  gage  is  shown  in  Ligure  2.  Except 
for  the  supplemental  rules  noted  below,  the  COL  is 
constructed,  in  all  cases,  from  the  nominal-stroke 
centerline  graphic-shape  outline  drawing  of  the  char¬ 
acters.  The  minimum  or  maximum  COL  may  be  con¬ 
sidered  as  being  generated  by  a  circle,  with  a  diameter 
equal  to  the  minimum  or  maximum  stroke  width  of 
the  respective  character  size  and  tolerance  range,  fol¬ 
lowing  the  character  centerline. 

The  supplemental  rules  for  modifying  all  circle¬ 
generated  COL  gages  are  as  follows: 

(1 )  The  maximum  COL  at  the  free  end  of  a  stroke 
shall  be  squared  off  by  ending  the  maximum  stroke- 
width  outline  one-half  the  maximum  stroke  width 
beyond  the  end  of  the  centerline.  The  ends  of  the 
outline  shall  be  joined  by  lines  perpendicular  to  those 
extended  stroke  outlines.  The  resultant  corners  are  to 
be  constructed  as  sharp  as  possible  within  the  limits  of 
the  construction  method.  In  no  case  are  these  corners 
to  be  rounded  off  with  more  than  a  radius  of  0.10  mm 
(0.004  in). 

(2)  All  internal  corners  on  the  maximum  COL  shall 
be  rounded  with  a  radius  of  0.10  mm  (0.004  in),  ex¬ 
cept  in  the  case  of  a  character  that  has  two  internal 
corners  closer  than  0.20  mm  (0.008  in),  in  which  case 
the  largest  possible  radius  shall  be  used  (for  example, 
the  numbers  1  and  8  and  z  stroke-width  tolerance). 

(3)  When  an  OCR  font  shape  standard  specifies  an 
external  sharp  corner,  both  the  maximum  and  mini- 


RULE  1 


(0.004  in)  RADIUS  MAX] 


Figure  2 
COL  Gage 


mum  COL  outline  shall  be  constructed  with  these 
corners  sharp.  The  limits  of  COL  construction  methods 
may  require  some  rounding  of  sharp  corners.  In  no  case 
shall  these  sharp  corners  be  rounded  off  with  more 
than  a  radius  0.10  mm  (0.004  in). 

(4)  Whenever  any  rounding  of  sharp  corners  due  to 
COL  gage  construction  limitations  causes  a  portion  of 
the  character  to  violate  the  COL,  the  requirements  of 
the  font  shape  standard  and  these  supplemental  rules 
shall  prevail. 

It  should  be  noted  that  having  established  the  shape 
of  the  COL  gage,  it  is  not  always  possible  to  overlay 
the  transparent  COL  gage  on  the  exact  printed  char¬ 
acter  centerline  because  the  centerline  is  not  visible  in 
tire  printed  character.  Because  of  this  difficulty,  the 
measured  character  is  considered  to  be  of  the  correct 
shape  if  the  gage  can  be  placed  over  the  character  in 
any  position  such  that  the  edge  requirements  of  Sec¬ 
tion  4  are  satisfied. 

3.3.4  Print  Contrast  Signal  Degradation.  Print  con¬ 
trast  signal  degradation  is  the  result  of  poor  inking  with 
particular  problems  caused  by  the  effects  of  voids  and 
spots. 

Voids  are  areas  inside  the  minimum  COL  where  the 
printing  process  did  not  deposit  sufficient  material  on 
the  background  area  and  which  have  significantly  higher 
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reflectance  than  other  areas  of  the  printed  image.  (For 
example,  see  Figure  3(a).) 

Spots  are  areas  outside  the  printed  character  with 
significantly  lower  reflectance  than  the  surrounding 
background  area.  They  are  caused  by  dirt  in  the  paper 
or  smudging  or  splattering  of  ink  during  the  printing 
process.  (For  examples,  see  Figure  3(a).) 

Voids  and  spots  are  specified  with  two  objectives 
in  mind:  ( 1 )  to  allow  physical  measurement  of  the 
printed  characteristics  of  a  character  and  (2)  to  antici¬ 
pate  the  scanner  signal  resulting  from  the  actual  scan¬ 
ning  of  the  degraded  character.  (See  Figures  3  and  4.) 

The  physical  and  scanning  parameters  of  voids  and 
spots  shall  be  described  as  follows: 

(1)  The  physical  parameters  are  expressed  in  terms 
of  size,  shape,  separation,  and  reflectance  level,  as  mea¬ 
sured  by  a  scanning  aperture  of  0.2  mm  (0.008  in). 

(See  Figure  4.) 

(2)  The  scanning  parameters  are  expressed  in  terms 
of  PCSvold ,  PCSspot,  PCSmin ,  and  PCSmax ,  which 
are  measures  of  the  signal  degradation  related  to  voids 
and  spots. 

3.3.4. 1  Physical  Voids.  Physical  voids  are  de¬ 
fined  as  areas  within  the  minimum  COL,  which  can  be 
identified  as  being  lighter  than  a  predetermined  PCS 
level.  This  PCS  level  is  designated  the  Void  Threshold 
Level.  Some  voids  are  allowed,  with  the  distinction  be¬ 
tween  allowable  and  nonallowable  voids  being  based 
on  their  size,  separation,  and  PCS  value  (see  Figure  4). 
All  measurements  are  made  with  an  aperture  of  0.2 
mm  (0.008  in). 

The  criteria  for  the  size,  separation,  and  threshold 
level  of  physical  voids  are  described  in  3. 3 .4.1 .1 
through  3.3 .4.1 .3. 

3. 3.4. 1.1  Void  Size.  Small  voids  shall  be  al¬ 
lowed  if  the  distance  (q)  that  the  measuring  aperture 
can  be  moved  with  a  PCS  equal  to  or  less  than  the 
specified  Void  Threshold  Level  nowhere  exceeds  the 
distance  given  below  in  (1),  (2),  or  (3).  The  movement 
shall  be  in  a  straight  line  and  shall  at  all  times  remain 
inside  the  minimum  COL.  For  any  particular  void, 
movement  of  the  aperture  shall  be  in  only  a  single 
selected  direction,  i.e.,  a  void  may  be  somewhat  ob¬ 
long,  but  may  exceed  0.2  mm  (0.008  in)  in  only  a 
single  direction. 

( 1 )  Range  x  Quality,  q  =  0.  No  movement  of  aper¬ 
ture,  i.e.,  only  individual  points  are  permitted. 

(2)  Range y  Quality,  q  <  0.1  mm  (0.004  in). 

(3)  Range  z  Quality,  q  <  0.2  mm  (0.008  in). 

3.3 .4. 1.2  Void  Separation.  The  separation 
between  any  two  allowable  voids  (that  is  where  the 
PCS  value  is  equal  to  or  less  than  the  specified  Void 
Threshold  Level)  shall  exceed  1 .0  mm  (0.040  in) 
center  to  center.  Therefore,  a  small  void  that  is  al¬ 


lowable  under  the  void  size  criteria  shall  become 
unallowable  if  it  is  within  1 .0  mm  (0.040  in)  of  any 
other  void.  (See  Figure  4.) 

3.3.4. 1.3  Void  Threshold  Level.  For  instru¬ 
mented  measurement  of  physical  voids,  a  PCS  level 
shall  be  chosen  such  that  an  area  within  the  COL  with 
a  measured  PCS  below  this  level  is  considered  to  be  a 
candidate  for  being  a  void  (subject  to  the  size  and 
separation  criteria).  This  level  should  be  consistent 
with  the  quality  of  printing  desired  and  compatible 
with  the  capability  of  the  OCR  reader  to  be  used  to 
read  the  printing.  A  high  threshold  level  is  more  re¬ 
strictive  of  the  printer  while  a  lower  threshold  allows 
larger  and  lighter  voids.  (See  Figure  4.) 

3. 3. 4. 2  Physical  Spots.  Physical  spots  are  de¬ 
fined  as  areas  outside  the  maximum  COL,  but  within 
the  area  of  interest,  which  can  be  identified  as  being 
darker  than  a  predetermined  PCS  level.  This  level  is 
designated  the  Spot  Threshold  Level.  Some  spots  are 
allowed,  with  the  distinction  between  allowable  and 
nonallowable  spots  being  based  on  their  size,  separa¬ 
tion,  and  PCS  levels.  Spots  may  be  freestanding  or  may 
be  connected  to  parts  of  the  printed  image.  All  mea¬ 
surements  are  made  with  an  aperture  of  0.2  mm 
(0.008  in).  (See  Figure  3  and  Figure  4.) 

The  criteria  for  the  size,  separation,  and  threshold 
level  of  physical  spots  are  described  in  3.3.4.2.1 
through  3. 3 .4. 2 .3. 

3.3.4. 2.1  Spot  Size.  The  distance  (q)  that 
the  PCS  measuring  aperture  can  be  moved  in  a  straight 
line  with  a  PCS  equal  to  or  more  than  the  Spot 
Threshold  Level  shall  nowhere  exceed  the  distances 
given  in  3.3 .4.1 .1  for  voids.  The  measuring  aperture 
shall  remain  at  all  times  outside  the  maximum  COL. 

For  any  particular  spot,  movement  of  the  aperture 

is  permitted  in  only  a  single  selected  direction.  (Thus, 
a  spot  may  be  somewhat  oblong  but  may  exceed  0.2 
mm  [0.008  in]  in  only  a  single  direction.) 

3. 3.4. 2. 2  Spot  Separation.  The  separation 
distance,  center  to  center,  between  any  two  allowable 
spots  (that  is,  where  the  PCS  value  is  equal  to  or  greater 
than  the  specified  Spot  Threshold  Level  but  is  within 
the  size  limits)  shall  exceed  1 .0  mm  (0.040  in)  from 
the  nearest  spot. 

3.3.4. 2.3  Spot  Threshold  Level.  For  instru¬ 
mented  measurement  of  physical  spots,  a  PCS  level 
shall  be  chosen  such  that  an  area  outside  the  COL  with 
a  measured  PCS  above  this  level  is  considered  to  be  a 
candidate  for  being  a  spot  (subject  to  the  size  and 
separation  criteria).  This  level  should  be  consistent 
with  the  quality  of  printing  desired  and  the  capability 
of  the  OCR  reader  to  discriminate  against  spots.  A 
lower  threshold  is  more  restrictive  of  the  printer  while 
a  higher  threshold  level  allows  larger  and  darker  spots. 

1 1 
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(a)  Section  of  a  Character  Stroke  Illustrating  Voids  and  Spots 


(b)  Signal  Resulting  from  a  Scan  along  Centerline  (a-a) 
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(c)  Signal  Resulting  from  a  Scan  along  (b-b) 
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(d)  Signal  Resulting  from  a  Scan  along  (c-c) 

Figure  3 

Character  Stroke  Degradation  and  the  Resulting  Test  Signal  Device 


12 


AMERICAN  NATIONAL  STANDARD  X3. 99-1  983 


Figure  4 

Physical  Void  and  Spot  Violation  in  Terms  of  Scan  Signal 


3. 3.4. 3  Scanning  Signal  Parameters  —  PCSVOid/ 

pr*\ 

*  '-‘3spot 

3. 3.4. 3.1  PCSvoid.  PCSvojd  is  defined  as  the 
maximum  Void  Threshold  Level  at  which  all  voids  are 
allowable.  For  example,  PCSVOjd  represents  the  highest 
level  to  which  the  Void  Threshold  Level  may  be  ad¬ 
justed  and  at  which  no  void  will  be  detected  using  the 
criteria  of  3.3 .4. 1 ,  i.e exceeds  the  size  and  separation 
limit  for  that  range  of  quality  (see  Figure  5).  It  is  a  mea¬ 
sure  of  the  void  free  signal  that  is  available  to  the  op¬ 
tical  reader. 

3. 3.4. 3. 2  PCSspot.  PCSspot  is  defined  as  the 
minimum  Spot  Threshold  Level  at  which  all  spots  are 
allowable,  i.e.,  PCSspot  represents  the  lowest  level  to 
which  the  Spot  Threshold  Level  may  be  adjusted  and  at 
which  no  spot  will  be  detected  using  the  criteria  of 

3. 3. 4 .2  (see  Figure  6).  It  is  a  measure  of  the  spot  signal 
that  the  optical  reader  must  overcome  to  be  able  to 
recognize  characters  reliably. 

The  ratio  PCSVOjd  and  PCSspot  is  a  measure  of  the 
signal-to-noise  ratio  of  the  printed  image.  To  reliably 
separate  the  character  from  the  background  noise,  the 
ratio  of  PCSvoid/PCSspot  should  be  high. 

3. 3. 4. 4  Contrast  Variation  Ratio.  The  variation 
of  contrast  within  a  character  is  expressed  by  the  Con¬ 
trast  Variation  Ratio  (CVR),  defined  as: 


CVR  = 


PCSr 


PCS 

X  V.  in 


where 

PCSm  jn  =  the  minimum  PCS,  which  is  the  lowest 
PCS  level  that  is  not  exceeded  for  a  scan¬ 
ning  distance  of  0.20  mm  (0.008  in) 
within  the  minimum  COL.  (See  Figure  5.) 

PCSmax  =  the  maximum  PCS,  which  is  the  highest 
PCS  level  that  is  continuously  exceeded 
for  a  scanning  distance  of  0.20  mm 
(0.008  in)  within  the  maximum  COL. 

(See  Figure  5.) 

NOTE:  For  range  z  quality,  PCSvo,d  and  PCSmjn  are  the  same 
by  definition  if  PCSvojd  is  determined  by  the  void  size  criteria 
and  not  by  the  separation,  i.e.,  the  distance  the  aperture  can  be 
moved  is  0.20  mm  (0.008  in)  in  both  cases. 

3.4  Spectral  Requirements  for  Measurement  of  PCS. 

Inks  intended  to  be  read  by  OCR  systems  require  a 
high  level  of  light  absorption  compared  to  the  back¬ 
ground,  which  is  usually  white  paper.  As  a  result,  they 
produce  a  high  PCS  value  (PCS  =  ( Rw  -Rp)/Rw). 
Carbon-pigmented  inks  absorb  light  over  a  broad  spec¬ 
tral  range.  However,  dye-based  inks  are  more  spectrally 
reflective  and  thus  must  be  selected  with  care  to  pro¬ 
duce  high  PCS  values  over  ranges  of  interest.  For  the 
most  part,  dye-based  inks  are  nonread  inks  for  OCR 
systems. 

The  spectral  response  characteristics  of  OCR  systems 
are  dependent  on  the  illumination  source  and  the  sen¬ 
sors.  As  a  result,  OCR  reading  devices  are  designed  to 
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Figure  5 

Measured  PCSy^,  PCSm  jn ,  and  PCSmax  for 
x,  v ,  and  z  Ranges  from  the  Same  Stroke 


Figure  6 

PCSspot  for x,  y,  and  z  Range  Printing 
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Table  1 

Nominal  Character  Sizes  and  Stroke-Width  Tolerances 


Size 

Nominal  Centerline 

Normal 

Stroke 

Width  ( T) 

Stroke-Width  Tolerances 

* 

Height  (If) 

Width  (W) 

Rang 

0  X 

Rani 

»e  y 

Ran; 

tic  z 

mm 

in 

mm 

in 

m  m 

in 

m  m 

in 

mm 

in 

m  m 

in 

1 

2.40 

0.094 

1.40 

0.055 

0.36 

0.014 

±0.08 

±0.003 

+  0.15 

+  0.006 

±0.20 

±0.008 

III 

3.20 

0.126 

1.52 

0.060 

0.38 

0.015 

±0.08 

±0.003 

±0.18 

±0.007 

+0.28 

+0.01  1 

-0.18 

-0.007 

IV 

3.81 

0.150 

2.03 

0.080 

0.51 

0.020 

±0.13 

±0.005 

±0.25 

+  0.010 

+0.41 

+0.016 

-0.25 

-0.010 

*Range  tolerance  does  not  apply  to  the  Character  Lrase  symbol. 

NOTES: 

(1)  The  height  and  width  of  OCR-B  characters  differ  slightly  from  character  to  character.  However,  the  largest  OCR-B  characters 
are  no  larger  than  the  values  shown;  therefore,  all  requirements  specified  in  this  standard  are  applicable. 

(2)  When  using  lowercase  letters  of  Size  I  in  the  OCR  set,  the  range  x  tolerance  should  be  applicable  to  all  characters  in  the  set. 

(3)  The  lowercase  letters  i.  j,  in,  p,  and  w  of  OCR-A  were  designed  to  exceed  the  nominal  H  or  W  values  given  in  this  table  because 
of  their  unique  characteristics. 

(4)  The  dimensions  in  the  metric  and  the  inch-pound  systems  given  in  this  standard  are  not  precisely  equivalent.  I  or  consistency, 
type  designers  may  adopt  either  system  but  they  should  not  be  intermixed. 

(5)  Character  Erase,  Group  Erase,  Character  Space,  and  Long  Vertical  Mark  (LVM)  were  designed  to  exceed  the  nominal  H  or  W 
values  given  in  this  table  because  of  their  unique  characteristics. 

(6)  Size  designation  refers  to  ANSI  OCR-A.  as  defined  in  ANSI  X3. 17-1981 .  and  ANSI  OCR-B,  as  defined  in  ANSI  X3. 49-1982. 
The  Size  II  designation  is  reserved  for  international  use. 


have  different  spectral  responses  in  the  ultraviolet,  the 
visible,  or  the  near  infrared  regions.  Carbon-pigmented 
inks  should  be  considered  as  read  inks  on  all  OCR 
equipment.  However,  certain  printing  devices  may 
not  use  carbon-pigmented  inks  and  then  readability 
may  not  be  reliable  in  all  spectral  regions.  Nonread 
inks  can  be  formulated  with  a  low  absorption  in  the 
spectral  range  where  the  reader  is  most  sensitive.  At 
the  same  time,  these  OCR  nonread  inks  can  be  very 
readable  by  humans. 

The  exact  selection  and  formulation  of  these  non¬ 
read  inks  should  be  determined  by  both  the  user  and 
OCR  equipment  manufacturer  for  optimum  results. 

The  spectral  response  of  instruments  to  be  used  to 
measure  and  determine  PCS  should  also  be  negotiated 
by  the  instrument  manufacturer  and  the  OCR  equip¬ 
ment  manufacturer  to  ensure  compatibility.  The  fol¬ 
lowing  spectral  bands  have  been  used  to  measure  PCS: 

(1)  Near  ultraviolet 

(2)  Visual 

(3)  Infrared 

See  ANSI  X3 .86-1980  for  a  further  description  of  OCR 
read  and  nonread  inks. 

4.  Measurement  of  Character  Geometry 

4. 1  General.  The  conformance  of  a  printed  image  to 
its  shape  specification  may  be  measured  using  COL 
gages.  COL  gages  are  designed  to  conform  to  the  char¬ 


acter  set  size  as  specified  by  the  appropriate  shape 
standard  and  tolerance  range. 

Table  1  shows  the  nominal  centerline  character 
height  and  width,  the  nominal  stroke  width  for  char¬ 
acters,  and  the  stroke-width  tolerance  values  for  qual¬ 
ity  ranges  ,xr,  y,  and  z.  Each  print  quality  range  specifies 
the  maximum  permitted  variation  between  the  nominal 
stroke  width  and  the  actual  stroke  width  of  any  char¬ 
acter. 

The  COLs  which  are  drawn  using  the  values  from 
Table  1  are  normally  displayed  on  a  1:1  scale  for  direct 
overlay  on  the  printed  character  and  virtually  enlarged 
only  by  the  magnification  factor  of  a  hand-held  magni¬ 
fier.  For  critical  applications,  20: 1  and  even  50: 1  COL 
enlargements  are  recommended  to  help  determine  char¬ 
acter  shape  and  edge  irregularity  conditions.  COL  gages 
do  not  define  all  the  print  quality  parameters;  they 
provide  a  simplified  means  for  an  approximate  evalua¬ 
tion  of  the  printed  image  shape  quality  for  OCR  char¬ 
acters  used  for  data  entry  systems. 

4.2  Determination  of  Stroke  Edges.  To  determine  if 
a  printed  image  (character  under  test  )  is  within  speci¬ 
fied  tolerance  limits,  it  is  first  necessary  to  establish 
the  location  of  the  stroke  edge  and  then  to  determine 
if  these  stroke  edges  comply  with  the  requirements  re¬ 
lating  to  the  character  geometry.  The  location  of  the 
stroke  edges  of  a  character  under  test  can  be  deter¬ 
mined  by  instrumented  measurements  as  follows  or 
approximately  as  described  in  Section  6. 
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ALLOWABLE  EDGE 

IRREGULARITY  STROKE  EDGE 


IRREGULARITY 


MAX  COL 

MIN  COL 


MIN  COL 


MAX  COL 


Figure  7 

Enlarged  View  of  Stroke  Variation 


Compliance  with  character  geometry  requirements 
depends  on  the  ability  to  define  the  edge  of  the  strokes. 
Stroke  edges  produced  by  high-quality  printers  (i.e., 
print  quality  range  x)  are  relatively  easy  to  identify 
visually.  Instrumented  methods  should  give  results 
reasonably  consistent  with  results  observed  visually. 
When  stroke  edges  are  not  distinct  due  to  smudges, 
nonuniform  density,  or  shading  at  the  edges,  etc,  the 
instrumented  method  may  be  necessary,  and  it  should 
be  closely  related  to  the  scanning  and  recognition  sys¬ 
tems  to  be  used.  Stroke  edges  are  defined  as  the  loci 
of  points  defined  by  the  center  of  the  scanning  aper¬ 
ture  when  tracking  around  the  character,  using  a 
threshold  value  of  1/2  PCSavg.  PCSavg  is  the  arithmetic 
average  value  of  PCS  measured  over  the  darkest  80%  of 
the  character  within  the  minimum  COL. 

4.3  Determination  of  Character  Geometry  for  Print 
Quality  Ranges.  If  any  single  position  of  the  COL  gage 
can  be  found  where  all  strokes  of  the  character  defined 
by  the  stroke  edges  as  determined  above  fill  the  mini¬ 
mum  COL  and,  at  the  same  time,  do  not  extend  beyond 
the  boundaries  of  the  maximum  COL,  then  the  char¬ 
acter  falls  within  the  given  print  quality  range.  Small 
discrepancies,  which  are  subject  to  further  measure¬ 
ment  as  voids  or  spots,  or  both,  should  be  ignored 


when  making  this  measurement.  However,  under  no 
circumstances,  even  if  allowable  under  the  definition 
of  spots  or  voids  given  in  Section  2,  shall  a  small  edge 
irregularity  cross  over  the  COL  line  for  a  distance  ex¬ 
ceeding  the  value  d  as  measured  along  the  COL  line 
(see  Figure  7).  The  value  of  d  is: 


Print 

Quality 

Range 

Distance  (d) 

mm 

in 

X 

0.30 

0.012 

y 

0.40 

0.016 

z 

0.50 

0.020 

The  distance  from  any  allowable  edge  irregularity 
to  the  nearest  other  allowable  edge  irregularity  should 
be  at  least  1 .0  mm  (0.040  in)  measured  center  to  center 
along  the  COL  line. 


5.  Measurement  of  Inking  Quality 

5.1  General.  This  section  gives  recommended  reflect¬ 
ance  values  in  terms  of  PCS  or  CVR  of  printed  char¬ 
acters  and  describes  methods  for  laboratory  evaluation 
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of  the  optical  characteristics  that  affect  print  quality. 

The  techniques  for  PCS  and  CVR  determination  re¬ 
quire  a  significant  number  of  measurements  using 
sophisticated  equipment,  ranging  from  optical  reflectom- 
eters  to  computer  systems.  This  section  describes 
measuring  methods  using  such  equipment.  Section  6 
describes  simplified  print  quality  evaluation  methods 
that  can  be  used  in  the  field. 

5.2  Contrast  Variation  Ratio  (CVR)  of  a  Character. 

For  reliable  reading,  it  is  essential  that  the  character 
area  have  a  higher  PCS  value  than  the  background. 
Reliability  of  reading  may  also  be  decreased  as  the 
unevenness  or  CVR  of  the  ink  density  within  the 
printed  images  is  increased.  For  the  tolerance  ranges, 
the  major  portion  of  the  character  image  within  the 
minimum  COL  (normally  interpreted  as  the  measure¬ 
ment  made  along  the  character  stroke  virtual  center- 
lines)  should  have  PCS  values  greater  and  CVR  values 
lower  than  shown  in  Table  2. 

A  higher  PCSmjn  value,  and  conversely  a  lower 
CVR  reading,  result  in  improved  OCR  reader  per¬ 
formance. 

5.3  Voids.  For  the  measurement  of  physical  voids, 
it  is  recommended  that  the  chosen  Void  Threshold 
Level  be  equal  to  or  greater  than  0.3  PCS.  An  area 
with  a  PCSvoid  equal  to  or  less  than  0.3  is  a  physical 
void. 

5.4  Spots.  It  is  recommended  that  the  chosen  Spot 
Threshold  Level  be  less  than  0.3  PCS  and  also  less  than 
0.3  of  the  Void  Threshold  Level.  To  reliably  separate  the 
character  from  the  background  noise,  the  following 
limits  of  this  ratio  shall  be  observed: 

PCSv0ld 

ppo - >  1  -5  for  range  x 

PCSvoid 

57^ - >  1 .4  for  range  y 

PCSvoid 

- >  1 .3  for  range  z 

i'-,oSpot 


6.  Approximate  Methods  of  Visually  Evaluating 
Print  Quality 

6.1  General.  In  many  cases,  except  for  print  contrast, 
OCR  quality  levels  can  be  established  with  acceptable 
accuracy  without  having  to  employ  sophisticated 
equipment.  Stroke  edges  produced  by  high-quality 
printers  (i.e .,  x  tolerance  range)  are  relatively  easy 
to  identify  visually  for  most  applications,  and  visual 
methods  give  results  reasonably  consistent  with  re- 


Table  2 

PCS  and  CVR  Values  for  Printed  Characters 


Range 

Percentage  of 
Measured 

PCS 

Equal  to 
or  Greater 
Than 

Remaining 

PC'S 

1  v  °min 

CVR 

X 

90% 

0.70 

>0.47 

<1.50 

y 

85% 

0.50 

>0.35 

<1.75 

Z 

80% 

0.35 

>0.30 

<2.00 

NOTES: 

(1)  These  values  represent  minimum  requirements  and  every 
effort  should  be  made  to  exceed  them. 

(2)  Within  a  line  or  field  of  printed  characters,  PCS  values 
should  be  as  uniform  as  possible.  This  is  of  particular  im¬ 
portance  where  more  than  one  printing  method  is  used  to  gen¬ 
erate  these  characters. 


suits  obtained  using  reflectometers.  When  stroke  edges 
are  not  distinct  due  to  smudges,  nonuniform  density, 
or  shading  at  the  edges,  etc,  the  instrumented  method 
may  be  required  and  is  recommended. 

6.2  Visual  Determination  of  Character  Outline  Limit. 

Frequently,  a  COL  gage  is  used  to  determine  visually 
if  the  printed  character  meets  the  outline  requirements 
as  indicated  in  the  shape  standard.  Most  commonly, 
the  COL  gage  is  merely  a  transparency  depicting  the 
specified  maximum  and  minimum  character  outlines. 

In  practice,  the  transparency  is  laid  over  the  projected 
character  image  to  determine  its  geometry  relative  to 
the  specified  shape.  For  critical  applications,  20: 1 
and  even  50: 1  COL  enlargements  are  used  with  a  pre- 
cison  optical  comparator  to  help  determine  character 
shape  and  edge  irregularity  conditions. 

In  constructing  the  COL  gage  character  outlines, 
the  instructions  given  in  3.2.2  should  be  followed. 

It  should  be  noted  that  having  established  the  shape 
of  the  COL  gage,  it  is  not  always  possible  to  overlay 
the  transparent  COL  gage  on  the  exact  printed  char¬ 
acter  centerline  because  the  centerline  is  not  visible 
in  the  printed  character.  Because  of  this  difficulty, 
the  measured  character  is  considered  to  be  of  the  cor¬ 
rect  shape  if  the  gage  can  be  placed  over  the  character 
in  any  position  such  that  the  edge  requirements  of 
Section  4  are  satisfied. 

6.3  Visual  Identification  of  Allowable  Voids.  After 
the  position  for  overlaying  the  COL  gage  on  the  char¬ 
acter  is  determined,3  a  void  is  any  area  within  the  mini¬ 
mum  COL  that  is  devoid  of  ink. 


3  When  measuring  voids  in  characters  that  do  not  meet  the 
stroke  edge  requirements,  the  COL  gage  is  positioned  to  mini¬ 
mize  the  edge  overlap  of  the  COL. 
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A  void  is  allowable  if  it  satisfies  the  following  con- 
ditons: 

( 1 )  It  can  be  contained  entirely  within  the  areas 
defined  below: 

(a)  Range  x.  A  circle  with  a  diameter  of  0.2 
mm  (0.008  in),  estimated  visually. 

(b)  Range y.  An  area  described  by  moving  a  cir¬ 
cle  with  a  diameter  of  0.2  mm  (0.008  in)  along  a 
straight  line  0.1  mm  (0.004  in)  or  less,  estimated  visual¬ 
ly- 

(c)  Range  z .  An  area  described  by  moving  a 
circle  with  a  diameter  of  0.2  mm  (0.008  in)  along 
a  straight  line  0.2  mm  (0.008  in)  or  less,  estimated 
visually. 

(2)  The  separation  between  any  two  allowable  voids 
that  exceeds  one-third  the  area  of  an  inspection  circle 
that  is  0.2  mm  (0.008  in)  in  diameter  shall  be  at  least 

1 .0  mm  (0.040  in)  (see  Figure  8). 

(3)  Small  voids  or  groups  of  small  voids  having  a 
total  area  less  than  1/3  the  area  of  an  inspection  cir¬ 
cle  that  is  0.2  mm  (0.008  in)  in  diameter  are  allowed 
in  unlimited  number  (see  Figure  8). 

6.4  Visual  Identification  of  Allowable  Spots.  After 
the  best  fit  between  measuring  gage  and  character  is 
determined,  any  extraneous  ink  outside  the  maximum 
COL  is  defined  as  a  spot.  The  sizes  of  spots  determine 
if  they  are  allowable  or  not.  Any  extraneous  ink  of  suf¬ 
ficient  area  and  nearly  as  dark  as  the  lightest  printing 
within  the  minimum  COL  may  cause  reading  difficul¬ 
ties.  Where  there  is  a  substantial  question  of  subjective 
visual  judgment,  the  instrumented  method  based  upon 
PCS  should  be  used. 

A  spot  is  allowable  if  it  satisfies  the  following  con¬ 
ditions: 

(1)  It  can  be  contained  entirely  within  the  areas  as 
defined  below: 

(a)  Range  x.  A  circle  with  a  diameter  of  0.2  mm 
(0.008  in),  estimated  visually. 


(b)  Range  y.  An  area  described  by  moving  a  cir¬ 
cle  with  a  diameter  of  0.2  mm  (0.008  in)  along  a 
straight  line  0.1  mm  (0.004  in)  or  less,  estimated  visually 

(c)  Range  z.  An  area  described  by  moving  a  cir¬ 
cle  with  a  diameter  of  0.2  mm  (0.008  in)  along  a 
straight  line  0.2  mm  (0.008  in)  or  less,  estimated  visually 

(2)  The  separation  between  any  two  allowable  spots 
that  exceeds  one-third  the  area  of  an  inspection  circle 
that  is  0.2  mm  (0.008  in)  in  diameter  shall  be  at  least 
1 .0  mm  (0.040  in)  (see  Figure  9). 

It  is  important  to  note  one  practical  difference  be¬ 
tween  the  visual  and  PCS  methods  of  assessing  spots. 
With  the  visual  method,  the  assessment  depends  only 
on  the  dimensions  and  spacing  of  the  spots;  it  is  inde¬ 
pendent  of  the  density  of  the  associated  character.  In 
PCS  terms,  when  using  the  visual  method  of  determin¬ 
ing  PCS,  a  spot  associated  with  a  dense  character  may 
be  accepted  while  an  identical  spot  associated  with  a 
weak  character  will  be  rejected. 

The  size  and  frequency  of  spots  remote  from  a 
character  should  also  be  strictly  controlled.  Many  read¬ 
ing  operations  are  started  upon  detection  of  the  first 
black  point  observed,  and  if  a  spot  larger  than  0.2 
mm  (0.008  in)  occurs  in  any  direction,  the  recognition 
process  may  begin.  Spots  larger  than  0.2  mm  (0.008  in) 
in  any  direction  should  be  prevented. 

6.5  Other  Approximate  Methods  of  Evaluation.  Other 
less  precise  methods  of  visual  measurement  are  some¬ 
times  used  because  they  are  portable  and  inexpensive. 

For  some  OCR  fonts,  small  magnifiers  (i.e .,  5X  to 
10X)  are  employed.  Representations  of  certain  char¬ 
acter  outlines  and  various  spot  sizes  are  printed  on  a 
reticle  placed  in  contact  with  the  print. 

Scales  with  characters  reproduced  at  various  levels 
of  reflectance  have  also  been  used  for  certain  applica¬ 
tions.  These  allow  for  crude  but  frequently  adequate 
visual  evaluation  of  the  image’s  PCS  level. 
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(3)  Full-circle  diameter  range  2,  A  =  0.4  mm  (0.016  in) 
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Figure  9 
Spots 
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